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ABSTRACT 

Using N-body simulations and galaxy formation models, we study the galaxy stellar mass 
correlation and the two-point auto-correlation. The simulations are run with cosmological 
parameters from the WMAP first, third and seven year results, which mainly differ in the 
perturbation amplitude of cr 8 . The stellar mass of galaxies are determined using either a semi- 
analytical galaxy formation model or a simple empirical abundance matching method. Com- 
pared to the SDSS DR7 data at z = and the DEEP2 results at z = 1, we find that the 
predicted galaxy clusterings from the semi-analytical model are higher than the data at small 
scales, regardless of the adopted cosmology. Conversely, the abundance matching method 
predicts good agreement with the data at both z = and z = 1 for high <r$ cosmologies 
(WMAP1 & WMAP7), but the predictions from a low ct 8 cosmology (WMAP3) are signifi- 
cantly lower than the data at z = 0. We find that the excess clustering at small-scales in the 
semi-analytical model mainly arises from satellites in massive haloes, indicating that either 
the star formation is too efficient in low-mass haloes or tidal stripping is too inefficient at 
high redshift. Our results show that galaxy clustering is strongly affected by the models for 
galaxy formation, thus can be used to constrain the baryonic physics. The weak dependence 
of galaxy clustering on cosmological parameters makes it difficult to constrain the WMAP1 
and WMAP7 cosmologies. 

Keywords: methods: analytical -galaxies: mass function -galaxies: formation- cosmology: 
theory - dark matter - large-scales structure of Universe 
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1 INTRODUCTION 

The Cold Dark Matter (CDM) paradigm successfully describes the 
formation of structure in the Universe. In this paradigm, galaxies 
form in the potential wells of dark matter haloes via gas cooling 
and subsequent star formation (White & Rees 1978). Observations, 
such as satellite kinematics (Conroy et al. 2007; More et al. 2009), 
galaxy-galaxy lensing (e.g., Mandelbaum et al. 2006) and galaxy 
clustering (e.g., Zehavi et al. 2005), have shown that the properties 
of galaxies, such as their stellar mass, colour and morphology, are 
closely related to the inferred mass of their host haloes. Thus any 
successful model for galaxy formation must reproduce these obser- 
vations. 

Here we focus on the observed clustering of galaxies and ex- 
plore the effectiveness of this quantity in constraining galaxy for- 
mation models and cosmological parameters as this clustering is 
very sensitive to the host halo mass in which galaxies live. The two- 
point correlation function (2PCF) of galaxies has been accurately 
measured by large surveys, such as the Two Deep Field Galaxy 
Redshift Survey (Colless et al. 2001), the Sloan Digital Sky Survey 
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(York et al. 2000) at z = 0, and the DEEP2 Galaxy Redshift Sur- 
vey at z — 1 (Davis et al. 2003). The 2PCF appears to be a power- 
law over a wide range of scales, and it depends on redshift, lu- 
minosity, colour and morphology of galaxies (Norgerg et al. 2002; 
Coil et al. 2004; Zehavi et al. 2002, 2005; Li et al. 2006). Recently, 
Li & White (2009) extended the traditional 2PCF by measuring the 
stellar mass correlation function from the SDSS DR7 data, the so- 
called stellar mass correlation function (SMCF). This quantity pro- 
vides additional constraint on galaxy formation models, as it de- 
pends on the relative mass of galaxies at given scales. 

One tool often used to study galaxy clustering is Semi- 
Analytical Models (SAMs) of galaxy formation (White & Frenk 
1991). Early studies found that these models predicted cluster- 
ing which marginally agreed with the data (e.g., Kauffmann et al. 
1999). Recent deep surveys, which have included more faint galax- 
ies, have shown that currently SAMs predict too high a clustering 
amplitude at small scales (Weinmann et al. 2006; Li et al. 2007). 
The excess clustering in these studies was primarily due to the 
over-abundance of faint galaxies in the models. The recent model 
of Guo et al. (201 1) removed this over-abundance and was able to 
reproduce the local Stellar Mass Function (SMF) down to very low 
mass end. However, this model still over-predicts the small-scale 
clustering, albeit to a smaller degree. These results indicate that it 
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is not solely abundance of galaxies but their spatial locations which 
lead to a higher clustering at small scales. Guo et al. (2011) sug- 
gested that perhaps a low erg universe might remove this discrep- 
ancy. 

Another way to model galaxy clustering is the Abundance 
Matching Method (AMM, e.g. Vale & Ostriker 2004). This method 
assumes that there is a monotonic relation between a galaxy's stel- 
lar mass and its host halo's mass (or progenitor host halo's mass 
at the time of its accretion for a subhalo containing a satellite 
galaxy), and the stellar mass can be obtained by matching the halo 
(or subhalo) abundance to the observed SMFs. The biggest advan- 
tage of this approach is that the observed SMF is perfectly repro- 
duced. It was found that this simple approach can well reproduce 
the properties of galaxy clustering seen in the SDSS at z = (e.g., 
Moster et al. 2010). Extensions of this simple method by including 
more complicated dependence on galaxy type, redshift, host halo 
mass can be found elsewhere (Wang et al. 2006; Behroozi et al. 
2010; Neistein et al. 2011). 

In this paper, we use the SAM of Kang & van den Bosch 
(2008) (hereafter K08, model I) and the AMM (model II) to 
produce galaxy catalogues in three cosmologies based on the 
Wilkinson Microwave Anisotropy Probe (WMAP) first year, third 
year and seventh year results (Spergel et al. 2003: WMAP1; 
Spergel et al. 2007: WMAP3; Komatsu et al. 201 1: WMAP7). The 
model of Kang & van den Bosch (2008) slightly over-predicts the 
abundance of low-mass galaxies, and the parameters were tuned 
to fit the SMFs of Cole et al. (2001) and Bell et al. (2003). Here 
we modify it slightly by extending the gas cooling time in low- 
mass haloes to better match the low-mass end of the local SMF of 
Li & White (2009). By constructing a mock galaxy catalogue, we 
are able to determine what galaxies contribute to the small-scale 
clustering. We also investigate whether the current favored cosmo- 
logical parameters from the WMAP results, especially as, can be 
better constrained. 

The paper is organized as following: In Section 2, we outline 
how we construct the galaxy catalogue using merger trees from N- 
body simulations and the two methods discussed previously in the 
text, SAMs and the AMM. In Section 3, we present the predictions 
for the stellar mass clustering, 2PCF and their dependence on mass 
and colour. We focus on the differences between the two models 
and investigate the origin for the discrepancy between SAM and 
the data observed in previous studies. Finally, we conclude with a 
summary and discussion of our results in Section 4. 



2 BUILDING THE MOCK GALAXY CATALOGUES 

To predict the 2PCF and the stellar mass correlation, we need to 
produce a large sample of galaxies in a large cosmological volume. 
To achieve this, we build galaxy catalogues using N-body simu- 
lations of three different cosmologies. For each model galaxy, its 
stellar mass is determined in two ways. Model I is the slightly mod- 
ified semi-analytical model of K08, which is based on Kang et al. 
(2005, 2006). This model self-consistently models the physical pro- 
cesses governing stellar mass evolution, such as gas cooling, star 
formation, supernova and AGN feedback. Model II is the abun- 
dance matching method, which determines the stellar mass of each 
galaxy by using the observed SMF. These two methods are dis- 
cussed in detail below. 



2.1 Simulations & halo merger trees 

For both models we must first extract haloes from N-body simula- 
tions and determine their accretion history using a merger tree in 
order to "paint" luminous matter on to them. The N-body simu- 
lations were performed using the Gadget-2 code (Springel 2005). 
The three cosmologies are based on the WMAP1, WMAP3, and 
WMAP7 results, which mainly differing in the amplitude of power 
spectrum with erg. The amplitude of as for the WMAP1, WMAP3, 
and WMAP7 are 0.9,0.73 and 0.8 respectively. From here on, we 
will refer WMAP1 and WMAP7 as high— as cosmologies and 
WMAP3 as a low— erg cosmology. All simulations are run with 
1024 dark matter particles in a cube of 200 h~ 1 Mpc on each side. 

We briefly discuss the construction of halo merger trees here, 
for a more detailed discussion see Kang et al. (2005). At each snap- 
shot, dark matter haloes are identified using the Friends-of-Friends 
(FOF) algorithm. For each FOF halo we determine its virial radius, 
r v ir, defined as the radius centered on the most bound particle in- 
side of which the average density is A c (z) times the average den- 
sity of the universe (Bryan & Norman 1998). The mass inside r v i r 
is called as the virial mass m v i r . Inside each FOF halo, subhaloes 
are identified using the SUB FIND (Springel et al. 2001 ). Subhaloes 
are relics of haloes that have been accreted by a larger halo. Us- 
ing the FOF and subhalo catalogue, we can construct the (sub)halo 
merger trees and produce model galaxies inside each (sub)halo. 

The galaxies produced in this merger tree are called central 
galaxies if the galaxy is the largest galaxy at the center of each 
FOF halo. Any other galaxies associated with subhaloes are called 
satellites. For each satellite galaxy, we trace back to the time when 
it was a central galaxy of a FOF halo, and label the virial mass of 
the FOF halo as M aC c For central galaxy, the mass M a cc is the 
current virial mass of its FOF halo. This mass, M aC c, will later be 
used to determine the stellar mass in model II. 



2.2 Modeling galaxy stellar masses 

The main ingredients of the SAM used in this study are described 
in detail in Kang et al. (2005). These models allow one to easily 
investigate how galaxy properties vary as the underlying assump- 
tions regarding the baryonic physics are changed (e.g., Bower et al. 
2006; De Lucia & Blaizot 2007; Somerville et al. 2008; Guo et al. 
201 1). Although SAMs successfully reproduce a wide range of ob- 
servations, they have difficulty reproducing the shallow slope of 
SMF at the low-mass end. Recently Guo et al. (201 1) found that an 
enhanced supernova feedback and longer gas reincorporation time 
could reproduce the shallow slope measured from the SDSS DR7 
by Li & White (2009) down to a stellar mass of 1O 8 M . 

The model of Guo et al. (201 1) has the effect of decreasing the 
gas cooling rate, especially in low-mass haloes. Here we introduce 
a minor modification to the model of K08, to obtain a better match 
to the SMF. We parameterize the gas cooling in low mass haloes 
as. 

M cooi = f c * m hot /t dyn (1) 

where M coo i is the gas cooling rate onto the central galaxy in a 
halo, and mhot is the total gas content in that halo, and td yn is the 
dynamical time of the halo. In the K08 model, f c is effectively set 
to 1 which resulted in a steep slope at low-mass end. The parame- 
ters in K08 were normalized using the SMFs of Cole et al. (2001) 
and Bell et al. (2003). In this paper, we tune our model parameters 
to best match the SMF of Li & White (2009), which change the 
parameters only slightly. 
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Figure 1. The local stellar mass functions. The data points shows the 
Li & White (2009) results measured from the SDSS DR7. The red, blue 
and green solid lines are the predictions from model I for the WMAP1, 
WMAP7 and WMAP3 cosmologies with / c = 0.15. The dashed line is the 
result of K08 (/ c = 1.0) for the WMAP7 cosmology. Note that the turn off 
at lower mass (<~ 10 8 Mq) is due to the resolution of our simulations. 



In Fig. 1, we show the SMFs from our models and Li & White 

(2009) . Overall the observed SMF is well reproduced, though our 
models have an over abundance of massive galaxies (Af* > 3 x 
1O 11 M0). We will later see that this excess of massive galaxies is 
not the reason for the over-prediction of the SMCF. The solid lines 
are results with f c — 0.15 for the three cosmologies used in the 
paper, and the dashed line is the K08 result with f c — 1.0 for the 
WMAP7 cosmology. Compared to the K08 result, it is found that 
a lower f c can reduce the slope at small scales. Lowering f c fur- 
ther will decrease the faint-end slope, but it will also under-predict 
the abundance of galaxies around the characteristic stellar mass, 
M». Note that in our model gas which is heated by supernova feed- 
back is ejected from the central galaxy, but remains in the halo. In 
the model of Guo et al. (201 1), the fraction of gas ejected depends 
on the halo's potential, and this ejected gas is re-incorporated over 
longer time scales. This process is effectively included here by us- 
ing f c < 1. Note that at masses lower than ~ 10 8 Mq, our results 
are affected by the mass resolution of our simulations, which artifi- 
cially reduces the number of low mass galaxies. 

The second method we use to determine stellar mass is the 
AMM, originally proposed by Vale & Ostriker (2004). Observa- 
tions have shown that there is a tight scaling relation between 
galaxy properties and the host halo mass (e.g., Mandelbaum et al. 
2006). By assuming that there is a monotonic relation between 
galaxy stellar mass and its host halo mass at accretion (Af acc ), the 
stellar mass Af* can be determined by matching the mass func- 
tion of Macc of the model galaxy to the observed SMF, i.e. n(> 
Macc) = n(> Af, ). This match leaves a relation between the stel- 
lar mass and halo mass, called as Af, — Mh, a cc relation. Note that 
in this paper, we do not include any scatter in the Af* — Mh, aC c re- 
lation and also neglect any possible evolution with redshift. A full 
accounting of these effects is beyond the scope of this paper. We 
refer interested readers to Moster et al. (2010) and Behroozi et al. 

(2010) for discussions of these uncertainties. 
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Figure 2. The stellar mass and halo mass (M* — acc ) relation. Upper 
panel: the data points are from observation of Mandelbaum et al. (2006) 
and More et al. (2009), and the lines are predictions for the WMAP1 cos- 
mology, with dotted lines for model II and others for model I. Lower panel: 
the predicted relation for other cosmologies, as shown by their ratio to the 
WMAPl results. Here we have shifted the WMAP3 results by 1 for clarity. 



In Fig. 2, we show the matched Af* — Mh,acc relation. The 
upper panel shows the relations from the WMAPl cosmology, and 
the lower panel are for the WMAP3 and WMAP7 results, shown 
by their ratio to the WMAPl one. Note that the WMAP3 results 
are shifted by 1 for clarity. The solid lines are predictions from 
model I and dotted lines are the matched one from model II. In the 
upper panel, the observational results are from Mandelbaum et al. 
(2006) and More et al. (2009). Also shown are the relations for all 
satellite galaxies (red line), and satellites accreted at redshift z aC c > 
1 (magenta line), which we will now on refer to as early accreted 
satellites. 

Clearly, the two model predictions are very similar to the ob- 
servations except that model II over-predicts the halo mass for mas- 
sive galaxies. In model I, such a Af* — Alh.acc relation depends 
on whether a galaxy is a central galaxy or a satellite. For satellite 
galaxies, especially early accreted satellites, their host halo's mass 
at the time of accretion is lower relative to central galaxies. This is 
because the gas cooling and star formation efficiencies are higher 
at high redshift in the model. We will later see that these early ac- 
creted satellites now live in massive haloes, and it leads to a higher 
clustering at small scales. The lower panel shows that the predicted 
Af* — Mh,acc relation has little dependence on cosmology as we 
have set the model parameters to best fit the observed SMF. The 
resulting star formation efficiencies in given halo mass do not vary 
drastically between the three cosmologies analyzed. 
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Figure 3. The projected stellar mass correlations. Left panel: data points are from Li & White (2009). The colour lines are predictions from Model I (solid 
lines) and Model II (dotted lines), with red, blue and green colours referring to the WMAP1, WMAP7 and WMAP3 cosmologies. Right panel: the galaxy bias 
relative to the dark matter particles. Note that the results for the WMAP3 and WMAP7 cosmologies are shifted by factors of 10 and 3 for clarity. 
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Figure 4. The projected stellar mass correlations with dependence on galaxy type. Here only results from the WMAP1 cosmology are shown. Left panel: 
predictions from model I for central and satellite galaxies. Middle panel: the contribution of early accreted satellites (z acc > 1) to the total clustering. Right 
panel: the host halo mass of galaxies at z = in the two models. 



3 GALAXY CLUSTERING 

The traditional 2PCF only uses the position of each galaxy, and 
is thus incapable of constraining the properties of galaxy pairs at 
given distance. Li & White (2009) extended it by weighting each 
galaxy with its stellar mass, namely the stellar mass correlation 
function (SMCF). The projected SMCF is written as, 

£*(r p ,Tr)dn, (2) 

where £*(r p ,n) is the projected redshift-space correlation 
weighted by the product of the stellar masses in each pair, with 
distance of r p and tt. We compute the predicted SMCFs from the 
models in the same way as Li & White (2009). 

In the left panel of Fig. 3 we show the projected stellar mass 
correlation. The data points show the measurements of Li & White 
(2009) from the SDSS DR7. The solid, dotted lines show the pre- 
dictions from model I and model II, with red, blue and green colour 



referring to the WMAP1, WMAP7, and WMAP3 cosmologies re- 
spectively. It shows that model I over-predicts the SMCFs in all 
three cosmologies studied. The size of the discrepancy depends on 
the mass scale. At small masses the prediction is higher by a factor 
of 2 — 5 but at large masses the predicted SMCF is only 30% higher. 
The coloured lines show that lower as only slightly decreases the 
clustering amplitudes. The predictions from model II agree better 
with the data, from large scales to 0.3 ft _1 Mpc. At small scales, 
the predictions are only higher than the data by ~ 20%. 

In the right panel of Fig. 3, we show the galaxy bias, de- 
fined as the ratio of galaxy clustering relative to the clusering of 
underlying dark matter distribution, w* P / Wp" 1 , where w|> m is mea- 
sured directly from our simulations. Note that the results from the 
WMAP7 and WMAP3 cosmologies are shifted for clarity. On large 
scales, results from model II in the high as cosmologies fit the 
data equally well, but the WMAP3 cosmology is systematically 
lower. Li & White (2009) use the shape of galaxy bias to con- 
strain cosmology with the assumption that the galaxy bias is flat 
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Figure 5. The projected 2PCF of galaxies in different stellar mass bins. Data points are from Li & White (2009). Solid lines are for model I and dotted lines 
for model II. The dashed magenta lines show results from model I with the exclusion of early accreted satellites (z acc > 1). 



on large scales, and rising at small scales based on the model of 
De Lucia & Blaizot (2007). Indeed, we find that both models pre- 
dict similar shapes, indicating that the shape of bias is a generic 
prediction of galaxy formation models. 

To further explore the origin of the discrepancy of model I 
with the data, we show in Fig. 4 the clustering of galaxies of dif- 
ferent types. The left panel shows the SMCFs of centrals, satellites 
and early accreted satellites, respectively. The middle panel gives 
the contribution of early accreted satellites to the total SMCF. In the 
right panel we show the host halo mass at z = for galaxies with 
different types, also shown are the results from model II (dashed 
lines). 

The left and middle panels show that satellites, primarily early 
accreted satellites, are the main contribution to the clustering ampli- 
tude at small scales. By excluding these early accreted satellites, the 
clustering amplitude can be significantly suppressed at small scales 
(dashed line). The right panel shows that satellites primarily live in 
big haloes (> 10 13 M@) and those early accreted ones (z acc > 1) 
reside in even more massive haloes (M host > 10 14 M ). 

The results in the above figure are easy to understand. In the 
CDM universe, massive haloes grow by merger of small haloes at 
early times, and N-body simulations have shown (e.g., Gao et al. 
2004) that massive haloes accrete more low-mass haloes. The ma- 
genta line in Fig. 2 implies that star formation efficiency in low- 
mass haloes from model I is higher than that from model II. Conse- 
quently, for given stellar mass, the galaxies in model I live in more 
massive haloes. From the halo models (e.g., Mo & White 1996; 
Cooray & Sheth 2002), it is known that massive haloes are strongly 



clustered at all scales. The higher SMCFs from model I imply that 
the stellar mass of satellites are over-estimated, indicating that in 
the SAM either the star formation efficiency in low-mass haloes 
was too high at high redshift, or the tidal stripping is too inefficient 
for satellites. 

In Fig. 5, we show the projected 2PCFs of galaxies in different 
stellar mass bins for the two models. As in previous plots, the solid 
lines are for model I and dotted lines for model II. It is found that 
model I agrees with the data on all scales for very massive galax- 
ies (> 10 11 ' 27 Mq), but for less massive galaxies, the predictions 
agree with the data only on large scales. For the lowest mass bin 
(lgM* < 9.27) the predictions are higher on all scales. As noted 
by Guo et al. (2011), the low-mass galaxies in the SDSS DR7 are 
severely affected by a few structures and distorted by peculiar ve- 
locities. Accounting for these effects is beyond the scope of our 
paper, so we limit our discussion on galaxies with mass higher than 

10 9 ' 77 Mq. 

Remarkably, the predictions from model II are lower than that 
from model I and agree well with the data on all scales for galaxies 
with mass larger than 10 9 ' 77 Mq. Such an agreement was also re- 
cently shown by other studies (e.g., Moster et al. 2010). The most 
distinct improvement from model II is the suppression of cluster- 
ings on small scales. We have shown that this is because galaxies 
are on average living in low-mass haloes in model II. As a test, 
we show predictions with the exclusion of early accreted satellites 
(zacc > 1) in model I using the dashed magenta lines. As seen from 
Fig. 4 the early accreted satellites dominate the power on small 
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Figure 6. The projected 2PCFs of red and blue galaxies in different stellar mass bins. The solid lines are for model I and dotted ones for model II, with red 
and blue lines for red and blue galaxies, respectively. For clarity here only predictions from the WMAP1 cosmology are shown, as the other two cosmologies 
give similar results. 
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Figure 7. The stellar mass correlation function at z = 1. The right panel is for the bias. The predictions from model II have used the local M» — acc 
relation (dotted line in the left panel) and the evolved one from Moster et al. (2010) (dashed lines in the middle panel). 



scales, neglecting them significantly improves the agreement be- 
tween model I and the observations. 

The excess of clustering on small scales from the SAMs has 
already been well known. It was recently suggested that a low ag 
universe may provide better fit to the clustering on small scales, as 
such a cosmology contains fewer massive haloes (e.g., Guo et al. 
201 1). Here we find that predicted galaxy clustering from the SAM 
is only slightly suppressed in the low— erg (WMAP3) universe and 
still lies above the data. Actually, we find that using model II with 



a high as ~ 0.9 (WMAP1) cosmology provides slightly better fit 
to the data than the other two cosmologies. Our results indicate 
that the model of determing galaxy mass plays the dominate role 
in the predicted clusterings. Consequently, due to our poor under- 
standing of galaxy formation physics, we must rely on other obser- 
vations, such as cosmic microwave background or weak lensing, 
which have little dependence on small-scale baryonic processes, to 
provide tight constraints on cosmological parameters. 

Now we consider the dependence of galaxy clustering on 



colour. The unobscured galaxy color is calculated using the lu- 
minosity in multiple wavebands. These luminosities are calculated 
by combining the star formation history with the stellar popula- 
tion synthesis models, and both are self-consistently modeled in 
our model I. We classify a galaxy as red or blue using the color- 
magnitude relation from Li et al. (2006). As model II does not pre- 
dict galaxy colour and to study the colour dependence in this model, 
we make a naive assumption that the galaxy has the same color as 
its counterpart in model I. This assumption is probably true as we 
know that a galaxy's colour is mainly affected by the merger his- 
tory of its host halo. For example, most satellites in model I are red 
because they were accreted at early times and their current star for- 
mation rates are very low. These objects are also expected to be red 
in model II as their stellar mass is set by the host halo mass at accre- 
tion, which implies that satellites experience no star formation once 
they are accreted. Most centrals are blue in model I as they undergo 
continuous gas cooling and star formation. They are also expected 
to be blue in model II as their stellar mass are determined by the 
current halo mass, again imply continuous star formation (see the 
good agreement of M s — Alh,acc relation in Fig 2 for centrals in 
both models). 

In Fig. 6 we show the predicted 2PCFs from model I (solid 
lines) and model II (dotted lines). Here only results from the 
WMAP1 cosmology are shown as the other cosmologies produce 
similar results. The red and blue lines are for red and blue galaxies, 
respectively. Quantitively, model I reproduces the colour depen- 
dence seen in the data, that is red galaxies in each mass bin are more 
clustered than blue ones. The predicted clustering of blue galaxies 
is in better agreement with the data than that of red galaxies. Model 
II provides better match to the data due to the suppression of clus- 
tering of red galaxies. The fact that we can reproduce the colour 
dependence indicates that the SAMs have correctly modeled the 
main physics governing galaxy colour, such as AGN feedback in 
massive centrals and tidal strangulation of hot halo gas from satel- 
lites. 



3.1 Stellar mass clustering at z = 1 

From the above, we have seen that model II provides better match 
to the data than model I at z = 0. Here we investigate how the 
predictions change with redshift, and in particular, we check if the 
clustering properties at z = 1 can place constraints on the used 
cosmologies. To obtain the stellar mass of model galaxy in model 
II at z = 1, we use both the local M* — Mh,acc relation in Fig. 2, 
and the one given by Moster et al. (2010) who obtained it by fitting 
the SMFs at z = 1. The results are shown in Fig. 7, where the data 
points of the DEEP2 are from Li et al. (201 1). 

Seen from the solid lines in the left panel that the SMCF from 
model I is still excessively clustered on small scales, and a low 
ag universe still can not resolve this discrepancy. The dotted lines 
in the left panel shows that model II with the local M s — Mh,acc 
relation can well reproduce the clustering at all scales, and the pre- 
dictions are identical for all the three cosmological models. Using 
the M s —Mh,acc relation of Moster et al. (2010) increases the clus- 
tering slightly but the results are still acceptable. This suggests that 
the stellar mass to halo mass relation is almost in place at z = 1, in- 
dicating that the tidal stripping effect for satellites is not significant 
at z < 1. The right panel shows the stellar mass bias, which again 
has a smooth transition that it is flat on large scales and growing on 
small scales, similar to the one in Fig. 3. This behaviour is indepen- 
dent of redshift, further supporting the conclusion of Li & White 
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(2009) that this form of galaxy bias is a generic prediction of galaxy 
formation models. 



4 CONCLUSIONS AND DISCUSSIONS 

In this paper, we have studied the stellar mass clustering and the 
projected two-point correlation functions of galaxies, and com- 
pared the model predictions to data from the SDSS (z — 0) and 
the DEEP2 (z = 1) (Li & White 2009; Li et al. 201 1). In order 
to explore the feasibility of constraining cosmological parameters 
using these quantities, we ran N-body simulations with three differ- 
ent cosmologies based on the WMAP1, WMAP3, WMAP7 results, 
which mainly differ in ag values (0.9, 0.73, 0.81 respectively). We 
then populated the simulations with model galaxies, and determine 
their stellar mass using two different approaches: model I is a the 
Semi-Analytical Model, which self-consistently models the physics 
of star formation, model II is an empirical model where stellar 
masses are obtained by matching the observed stellar mass func- 
tion. We have obtained a few interesting results listed below. 

• The stellar mass clustering function predicted by the semi- 
analytical model is excessively clustered on small scales, and is 
still about 30% higher even at larger scales. The projected two- 
point correlation agrees with the data on large scales but is still too 
high on small scales. These results imply that the excess clustering 
is from an over-prediction of the stellar mass of satellites in massive 
haloes. We further found that this excess is mainly from satellites 
accreted at early times. These galaxies at z = live in massive 
haloes larger than 1O 14 M0 and as a result are strongly clustered. 
We also found that a low ag universe (ag = 0.8, 0.73) will only 
slightly decrease the small-scale clustering, but the discrepancies 
with observational data remain. 

• The abundance matching model provides a much better fit to 
the data at both z — and z = 1 than the semi-analytical model. 
This improvement is primarily due to the suppression of the num- 
ber of low mass satellite galaxies residing in massive haloes. We 
found that the WMAP1 and WMAP7 cosmologies (ag = 0.9, 0.8) 
both provide acceptable fit to the data, but the WMAP3 cosmology 
(ag — 0.73) predicts a galaxy clustering on large scales at z = 
that is lower than the observed one. 

• Qualitatively, the colour dependence of clustering is repro- 
duced from model I, with the over-prediction of clustering of red 
galaxies on small scales. The predicted clusterings of blue galax- 
ies agree better with the data. By suppressing the stellar mass of 
satellites in model II, the colour dependence of clustering is well 
reproduced in this model. These results indicate that the SAMs can 
marginally capture the main physics governing galaxy colour. Here 
we note that the colour of galaxy is more sensitive to the recent star 
formation history, thus being not capable of constraining its full star 
formation, especially that at early times. This is not in conflict with 
the conclusion that semi-analytical model over-predicts the mass of 
satellite galaxies. 

• The galaxy bias, defined as the ratio of galaxy clustering to 
that of the dark matter distribution, has a general form in both mod- 
els. That is, it is flat on large scales and rises at small scales without 
any strong transitional feature. The fact that this form appears in- 
dependent of galaxy formation model supports the argument used 
by Li & White (2009), who used this general shape to constrain 
cosmology parameter of erg. 

Overall, we find that the galaxy clustering is stronger affected 
by the model for galaxy formation than the adopted cosmology, 
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thus it is currently impossible to accurately constrain the cosmo- 
logical parameters. By comparing the predictions of the models in 
our paper, we find that the semi-analytical model over-estimates the 
stellar mass of satellites. This over-prediction indicates that either 
the star formation efficiency in low-mass haloes at high redshifts 
is too high or that the tidal stripping of satellites is too inefficient. 
Furthermore, our results at z = 1 shows that the stellar mass to 
halo mass relation is almost in place by z = 1, suggesting that tidal 
stripping at z < 1 should be weak. These results point to the fact 
that some other mechanisms, such as QSO feedback, stronger tidal 
stripping at high redshifts, should be incorporated into the current 
galaxy formation models to remove the discrepancy between S AMs 
and observational data. These process are expected to be important 
at z > 1 as QSO activities and galaxy interactions are much more 
common in the past. 
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